Search CORE

194 research outputs found

Multilingual Models for Compositional Distributed Semantics

Author: Blunsom Phil
Hermann Karl Moritz
Publication venue
Publication date: 01/01/2014
Field of study

We present a novel technique for learning semantic representations, which extends the distributional hypothesis to multilingual data and joint-space embeddings. Our models leverage parallel data and learn to strongly align the embeddings of semantically equivalent sentences, while maintaining sufficient distance between those of dissimilar sentences. The models do not rely on word alignments or any syntactic information and are successfully applied to a number of diverse languages. We extend our approach to learn semantic representations at the document level, too. We evaluate these models on two cross-lingual document classification tasks, outperforming the prior state of the art. Through qualitative analysis and the study of pivoting effects we demonstrate that our representations are semantically plausible and can capture semantic relationships across languages without parallel data.Comment: Proceedings of ACL 2014 (Long papers

arXiv.org e-Print Archive

Crossref

Distributed Representations for Compositional Semantics

Author: Hermann Karl Moritz
Publication venue
Publication date: 01/01/2014
Field of study

The mathematical representation of semantics is a key issue for Natural Language Processing (NLP). A lot of research has been devoted to finding ways of representing the semantics of individual words in vector spaces. Distributional approaches --- meaning distributed representations that exploit co-occurrence statistics of large corpora --- have proved popular and successful across a number of tasks. However, natural language usually comes in structures beyond the word level, with meaning arising not only from the individual words but also the structure they are contained in at the phrasal or sentential level. Modelling the compositional process by which the meaning of an utterance arises from the meaning of its parts is an equally fundamental task of NLP. This dissertation explores methods for learning distributed semantic representations and models for composing these into representations for larger linguistic units. Our underlying hypothesis is that neural models are a suitable vehicle for learning semantically rich representations and that such representations in turn are suitable vehicles for solving important tasks in natural language processing. The contribution of this thesis is a thorough evaluation of our hypothesis, as part of which we introduce several new approaches to representation learning and compositional semantics, as well as multiple state-of-the-art models which apply distributed semantic representations to various tasks in NLP.Comment: DPhil Thesis, University of Oxford, Submitted and accepted in 201

arXiv.org e-Print Archive

Oxford University Research Archive

"Not not bad" is not "bad": A distributional account of negation

Author: Blunsom Phil
Grefenstette Edward
Hermann Karl Moritz
Publication venue
Publication date: 01/01/2013
Field of study

With the increasing empirical success of distributional models of compositional semantics, it is timely to consider the types of textual logic that such models are capable of capturing. In this paper, we address shortcomings in the ability of current models to capture logical operations such as negation. As a solution we propose a tripartite formulation for a continuous vector space representation of semantics and subsequently use this representation to develop a formal compositional notion of negation within such models.Comment: 9 pages, to appear in Proceedings of the 2013 Workshop on Continuous Vector Space Models and their Compositionalit

arXiv.org e-Print Archive

CiteSeerX

Oxford University Research Archive

Learning Bilingual Word Representations by Marginalizing Alignments

Author: Blunsom Phil
Hermann Karl Moritz
Kočiský Tomáš
Publication venue
Publication date: 01/01/2014
Field of study

We present a probabilistic model that simultaneously learns alignments and distributed representations for bilingual data. By marginalizing over word alignments the model captures a larger semantic context than prior work relying on hard alignments. The advantage of this approach is demonstrated in a cross-lingual classification task, where we outperform the prior published state of the art.Comment: Proceedings of ACL 2014 (Short Papers

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

A Deep Architecture for Semantic Parsing

Author: Blunsom Phil
de Freitas Nando
Grefenstette Edward
Hermann Karl Moritz
Publication venue
Publication date: 01/01/2014
Field of study

Many successful approaches to semantic parsing build on top of the syntactic analysis of text, and make use of distributional representations or statistical models to match parses to ontology-specific queries. This paper presents a novel deep learning architecture which provides a semantic parsing system through the union of two neural models of language semantics. It allows for the generation of ontology-specific queries from natural language statements and questions without the need for parsing, which makes it especially suitable to grammatically malformed or syntactically atypical text, such as tweets, as well as permitting the development of semantic parsers for resource-poor languages.Comment: In Proceedings of the Semantic Parsing Workshop at ACL 2014 (forthcoming

arXiv.org e-Print Archive

CiteSeerX

Crossref

Oxford University Research Archive

Teaching Machines to Read and Comprehend

Author: Blunsom Phil
Espeholt Lasse
Grefenstette Edward
Hermann Karl Moritz
Kay Will
Kočiský Tomáš
Suleyman Mustafa
Publication venue
Publication date: 19/11/2015
Field of study

Teaching machines to read natural language documents remains an elusive challenge. Machine reading systems can be tested on their ability to answer questions posed on the contents of documents that they have seen, but until now large scale training and test datasets have been missing for this type of evaluation. In this work we define a new methodology that resolves this bottleneck and provides large scale supervised reading comprehension data. This allows us to develop a class of attention based deep neural networks that learn to read real documents and answer complex questions with minimal prior knowledge of language structure.Comment: Appears in: Advances in Neural Information Processing Systems 28 (NIPS 2015). 14 pages, 13 figure

arXiv.org e-Print Archive

Oxford University Research Archive

Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input

Author: Clark Stephen
Hermann Karl Moritz
Lazaridou Angeliki
Tuyls Karl
Publication venue
Publication date: 11/04/2018
Field of study

The ability of algorithms to evolve or learn (compositional) communication protocols has traditionally been studied in the language evolution literature through the use of emergent communication tasks. Here we scale up this research by using contemporary deep learning methods and by training reinforcement-learning neural network agents on referential communication games. We extend previous work, in which agents were trained in symbolic environments, by developing agents which are able to learn from raw pixel data, a more challenging and realistic input representation. We find that the degree of structure found in the input data affects the nature of the emerged protocols, and thereby corroborate the hypothesis that structured compositional language is most likely to emerge when agents perceive the world as being structured

arXiv.org e-Print Archive

University of Liverpool Repository